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Artificial intelligence has potential for forecasting reactor conditions in the nuclear industry. Owing to eco- 
nomic and security concerns, a common method is to train data generated by simulators. However, achieving 
a satisfactory performance in practical applications is difficult because simulators imperfectly emulate reality. 
To bridge this gap, we propose a novel framework called simulation-to-reality domain adaptation (SRDA) for 
forecasting the operating parameters of nuclear reactors. The SRDA model employs a transformer-based fea- 
ture extractor to capture dynamic characteristics and temporal dependencies. A parameter predictor with an 
improved logarithmic loss function is specifically designed to adapt to varying reactor powers. To fuse prior 
reactor knowledge from simulations with reality, the domain discriminator utilizes an adversarial strategy to 
ensure the learning of deep domain-invariant features, and the multiple kernel maximum mean discrepancy 
minimizes their discrepancies. Experiments on neutron fluxes and temperatures from a pressurized water re- 
actor illustrate that the SRDA model surpasses various advanced methods in terms of predictive performance. 
This study is the first to use domain adaptation for real-world reactor prediction and presents a feasible solution 
for enhancing the transferability and generalizability of simulated data. 
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I. INTRODUCTION 


The advancement of nuclear energy and its safe use can 
provide an impetus for human progress and support the pro- 
duction of high-end equipment, energy security, a reduction 
of climate change, and a shift to greener energy sources [1, 2]. 
Increasing the autonomous operation of nuclear power plants 
(NPPs) through digitalization and intellectualization is crucial 
for enhancing the efficiency and safety of nuclear energy, as 
well as for lowering operation and maintenance costs [3, 4]. 
The phrase "unmanned surveillance, few people on duty" de- 
scribes the automation activities of NPPs in the future [5]. 


One of the pivotal technologies required to fulfill this desire 
is the prediction of key parameters in NPPs, especially un- 
der transient conditions, to perform timely decision-making 
and ensure early warning [6, 7]. The precise forecasting of 
key parameters in reactors is a well-established challenge. In 
recent years, considerable efforts have been made to predict 
various operating parameters in reactor transient conditions, 
including in-core power [8], outlet temperature [9], coolant 
leakage [10], and pressure [11]. Typical artificial intelligence 
(AI) models with multidimensional mappability, support vec- 
tor regression (SVR), artificial neural networks (ANNs), and 
long short-term memory (LSTM) are frequently utilized in 
operating-parameter forecasting [12-14]. For example, Zeng 
et al. [15] combined SVR with particle filtering to predict the 
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core power and coolant temperature of a reactor and achieved 
satisfactory accuracy for reactor reactivity insertion events. 
Lu et al. [16] developed an ANN-based model for forecast- 
ing the thermal-hydraulic parameters in a KLT-40S nuclear 
reactor under steady-state operation, and their results were in 
good agreement with the RELAPS5 simulation. 


For economic and security purposes, frequently conducting 
trials in real-world NPPs may trigger uncontrollable events 
and equipment damage. Therefore, obtaining a large num- 
ber of transient samples is extremely challenging. A vi- 
able alternative for addressing the scarcity of real data is 
to apply numerical simulations to address class-imbalance 
problems [17]. For example, Xiang et al. [18] proposed 
a gear-oriented fault-detection method to enlarge fault sam- 
ples by integrating a finite-element-method simulation and 
generative adversarial network, achieving satisfactory results. 
This idea is suitable for a mechanical system with a cor- 
responding well-constructed simulation. Additionally, stud- 
ies have utilized various programs that can perform system- 
level simulations (such as RELAPS [19], TRACE [20], PC- 
TRAN [21], CASMOS5 [22], and PANGU [23]) to produce 
simulated data for AIl-model training and verification based 
on nuclear-engineering experience and physical knowledge. 
Li et al. [24] conducted a study using data from the Qinshan 
300 MWe NPP full-scope simulator. They combined an au- 
tomated feed-forward neural network with optimization algo- 
rithms, which could effectively forecast the steam mass flow 
rate and water temperature during transient reactor operation 
for up to five seconds in advance. Tan et al. [25] established 
a mathematical model to prove the equivalence of simulated 
and operation data when the mean noise distribution is zero. 
This indicates that the simulated data can provide a supple- 
mental dataset for the AI model in the initial training and the- 
oretical analysis. Although various simulators have been de- 


signed to closely mimic the operations of actual reactors, the 
simulated and real data still exhibit certain domain discrepan- 
cies in terms of noise, numerical distributions, and dynamic 
characteristics for the following reasons: 


(1) The mathematical models in simulators are simplified 
from real complex nuclear-power systems and cannot 
fully capture the nuanced physical processes. 


(2) The operating parameters and states of simulators grad- 
ually differ from actual reactors, especially the changes 
in burnup caused by reactor-lifespan variation. 


(3) In certain transient or extreme conditions, the dynamic 
response of simulators may marginally differ from the 
operations of actual reactors. 


Even if real data accurately capture the intricate features of 
environmental interactions in NPPs, the difficulty of data col- 
lection leads to an inability to encompass all possible scenar- 
ios. Data derived from theoretical models and computational 
simulations of nuclear systems are readily available and in- 
herently safer to obtain. Both the simulated and real data have 
unique state characteristics. Consequently, AI models trained 
solely on simulated or real data may be inaccurate and dif- 
ficult to apply in NPPs with high safety and reliability stan- 
dards [26]. Thus, investigating the transfer of prior knowl- 
edge from sufficiently simulated data to scarce actual data is 
essential for enhancing the precision of operating-parameter 
forecasts in NPPs. 


However, this scheme raises an open issue: how well do 
simulated data generalize to real data? Transfer learning, 
a deep-learning technique that aims to leverage pre-existing 
knowledge to improve the performance in a new task or do- 
main, has become a feasible solution [27]. Lin et al. [28] pro- 
posed a transfer-learning model using maximum mean dis- 
crepancy (MMD) and a convolutional neural network (CNN). 
The experimental results demonstrated that transferring prior 
diagnostic knowledge is conducive to expanding the scope 
of nuclear-accident diagnosis in NPPs. Domain adaptation, 
which is a subset of transfer learning, specifically focuses 
on addressing domain discrepancies between the source and 
target domains in scenarios of the same task [29]. For in- 
sufficiently learnable samples, numerous domain-adaptation 
methods have been developed to address situations in me- 
chanical fault diagnosis [30], medical-image analysis [31], 
and robot control [32]. Xiang et al. [33] established a fault- 
diagnosis method using simulations to obtain sufficient faults 
and domain adaptation to transfer the simulated knowledge to 
a real-world diagnosis. This approach not only supplements 
scarce fault samples but also mitigates the gap between sim- 
ulation and reality. Inspired by their work, domain adapta- 
tion theoretically has the potential to transfer knowledge from 
simulation to reality and learn the common feature subspace 
for parameter prediction, wherein simulated and real data are 
considered as the source and target domains, respectively. 
Thus, the effectiveness of domain-adaptation techniques in 


bridging the gap between simulation and reality should be in- 
vestigated, particularly for accurately forecasting critical pa- 
rameters in nuclear reactors. 


Based on the aforementioned discussion, this study aims to 
devise a transferability architecture for forecasting operating 
parameters in nuclear reactors using a simulation-to-reality 
domain adaptation (SRDA) model. Specifically, the SRDA 
model comprises four components: a feature extractor, pa- 
rameter predictor, domain discriminator, and multiple kernel 
maximum mean discrepancy (MK-MMD). The feature ex- 
tractor, as the backbone network within the SRDA model, is 
established using transformers that can capture dynamic char- 
acteristics and temporal dependencies from simulated and 
real data. A parameter predictor containing an improved log- 
arithmic loss function can perform precise forecasting tasks 
under distinct reactor power levels. The domain discrimina- 
tor utilizes an adversarial strategy that forces the feature ex- 
tractor to learn deep domain-invariant features. MK-MMD 
quantifies the discrepancies between simulated and real data 
through sophisticated high-dimensional mapping. The key 
contributions of this study are summarized as follows: 


(1) Unlike conventional methods solely using simulated- 
data modeling, the simulation-to-reality transferability 
model is pioneered by a novel technique in the domain 
adaptation of computer vision to precisely forecast crit- 
ical parameters in nuclear reactors. 


(2) The transformer uses a multi-head attention mechanism 
and is embedded as a feature extractor in the SRDA 
framework to capture both dynamic characteristics and 
temporal dependencies from simulated and real data. 
The improved logarithmic loss function in the predictor 
is refined to adapt varied power levels in reactors. 


(3) The SRDA framework is expertly developed to bridge 
the gap between the simulated and real data and harness 
the strengths of adversarial strategy (i.e., the extraction 
of deep domain-invariant features) with the MK-MMD 
(i.e., the minimization of domain-distribution discrep- 
ancies) simultaneously. 


The remainder of this article is organized as follows. Sec- 
tion II analyzes the differences between the simulated and real 
data. Section II introduces the proposed SRDA framework in 
detail. In Section IV, we validate the precision and superior- 
ity of the proposed method through comparative experiments. 
Finally, Section V concludes the paper. 


Il. PRELIMINARY ANALYSIS 


To visualize the differences between the simulated and real 
data, neutron fluxes at the same height in each of the four 
reactor channels are shown in Fig. 1. The four simulated 
curves of the height coincidence exhibited smooth tendencies 
over a narrower range, whereas the actual curves exhibited 
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Fig. 1. Numerical distributions of the simulated and real data 


obvious noise over a wider range. Discrepancies arise pri- 
marily from model simplifications and potential errors in the 
parameter estimation. For example, in simulating reactor dy- 
namics, certain assumptions must be made for computational 
feasibility, which can lead to deviations from the actual reac- 
tor responses. The dynamic characteristics of an NPP, such 
as its thermal-hydraulic behavior and neutron kinetics, were 
approximated in the simulations. However, these approxima- 
tions can oversimplify real phenomena. In real-world reac- 
tors, neutron-flux signals are collected by ex-core detectors, 
whose signals are primarily induced by neutrons and gamma 
rays, along with a component of electrical noise [34]. In ad- 
dition, burnup changes caused by reactor-lifespan variations 
cause the numerical distributions of the simulation and real- 
ity to shift progressively. These observations underscore the 
limitations of simulations and the complexities of real-world 
nuclear reactors, which further demonstrate the discrepancies 
in noise, numerical distributions, and dynamic characteristics 
between simulations and reality. 


II. METHODOLOGY 


A. Problem formulation 


The operating-parameter data X = {21,22,--- , £r} E€ 
RT are given hypothetically, where T denotes the length of 
the time series. In the prediction task in nuclear reactors, 
a sliding window is applied to construct the dataset D = 
{(X1,Y+)} € R™”, where t and n represent the t-th tempo- 
ral window and total number of windows, respectively. More 
specifically, X; = {£t-w—1;, + , Zt—1, Le} represents the in- 
put window of length w and Y; = {£t41, Lt4+2;,' ° , Etr} 
represents the forecasting window in the future 7 steps 
(T >1). The process can be viewed as a function Fp as fol- 
lows: 
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The prediction task involves transferring knowledge from 
ample simulated data (source domain) to scarce actual data 
(target domain) in NPPs. The simulated data represents the 
source domain DS = {(X?,Y°)} (X? © xS, YS € 
VS), and the real data represents the target domain D? = 
{(X7P,Y7)} (Xf e &7, YP e yT). They share a com- 
mon feature subspace, that is, XS = XT and Y! = YT. Ow- 
ing to the different actuation modes and system dynamics, the 
source and target domains have different marginal probability 
distributions, that is, P(X°) #4 P(X7). Thus, we design 
a transformation function Fp, to fulfill P(Fp4(X°)) = 
P(Fpa(X TYN; Function F'p 4 is combined with function Fp 
to establish the transferability prediction framework Fp = 
Fpa(Fp(-)), which performs forecasting missions in real- 
world nuclear reactors. 


B. Overview of SRDA 


The proposed SRDA framework aims to predict real data 
in nuclear reactors precisely over time by learning and trans- 
ferring prior knowledge from simulations to reality. As pre- 
sented in Fig. 2, the architecture of SRDA resembles that of 
conventional neural networks and possesses two output mod- 
ules instead of one. The SRDA model comprises four mod- 
ules. The blue module is the feature extractor G p and serves 
as the backbone network that directly affects the transfer ef- 
fect. We use a transformer as the feature extractor, which is 
described in detail in Section III D. The orange module, rep- 
resenting the parameter predictor Gp, is constructed using an 
improved logarithmic loss, a fully connected (FC) layer, and 
a prediction output layer, which forecasts the future variation 
of operating parameters in the reactors. The purple module 
represents the domain discriminator Gg and comprises two 
FC layers and a classification output layer. The purpose of 
Ga is to classify the training data from each domain and es- 
tablish an adversarial learning strategy with feature extractor 
Gy. In addition, the green module represents the multiple 
kernel maximum mean discrepancy (MK-MMD) used to esti- 
mate the discrepancies between the simulated and real reactor 
data after the feature extraction. 


C. Principles of domain adversarial strategy 


In the training phase, the samples X; from DS and DT 
are fed into the feature extractor G p to extract temporal fea- 


tures. Then, the obtained features X f are forwarded to both 
the parameter predictor Gp and domain discriminator Gq to 


forecast the operating parameters Y ; and generate the domain 
label d;. This process is expressed as follows: 


XÍ = Gy (X1;95) 
Y¥.=Gp (xf; 0, ) (2) 
d, = Ga (X7;61), 
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Fig. 2. Schematic of the simulation-to-reality domain adaptation (SRDA) framework 


where 0 f, 8,,, and Oq represent the trainable weight matrices 
in Gy, Gp, and Gg, respectively. 


This framework mitigates domain differences and makes 
precise parameter predictions by jointly training G f, Gp, and 
Ga. More specifically, training has two goals: (1) minimiz- 
ing the prediction loss for Gp and (2) maximizing the domain 
loss for Gq simultaneously, such that the domain discrimina- 
tor cannot distinguish the domain from which the obtained 
features originate [35]. Feature extractor Gy and domain 
discriminator G4 are trained adversarially to ensure that G's 
maps the simulated and real data into a common subspace 
and generates domain-invariant features. Consequently, the 
training convergence learns deep domain-invariant features in 
the feature extractor, which refers to temporal dependencies 
or generic patterns that do not significantly change between 
the simulated and real data. To perform adversarial training, 
the feature extractor G p and domain discriminator Gg are in- 
terconnected by a gradient-reversal layer to achieve optimal 
results. For efficient backpropagation, the trade-off loss func- 
tion (Liota1) in the framework is built and formalized as fol- 
lows: 


Ss 


Liota = = STL, ({G [Gy (Xi; 45) ; 8, LY.) 


nant 
-= aint 3 La ({Ga ICs (X1;0,):6a] }, de) 
+ Lyik-mmp 

nS yet 
= SLi (%.¥:) = ree 3 La (åa) 
+ LMK-MMD, 


(3) 


where À denotes the weight coefficient that adjusts the trade- 


off between the predictor, discriminator, and MK-MMD 
losses. n’ and n? are the numbers of samples for training 
from DS and DT, respectively. Lp» La, and Lyx-mmp de- 
note the loss functions of the predictor, discriminator, and 
MK-MMD. When the reactor operates at low power levels, 
the magnitudes of certain operating parameters vary consid- 
erably; thus, the model cannot accurately fit smaller values. 
The improved logarithmic function is specifically designed 
as a predictor loss function £p, and its formula is defined by 
Eq. (4). The cross-entropy function is used as the discrim- 
inator loss function La. Lyx-mmp iS described in detail in 
Section III E. 
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where € is a small constant that ensures that the predictions 
inside the logarithmic function are always positive and is set 
to 0.01. 


Lp, La, and Lyix-mmp have different scales corresponding 
to the losses in prediction, classification, and statistics, re- 
spectively. Thus, a trade-off learning strategy is developed 
for joint training, in which the weight À is adjusted dynami- 
cally and gradually increase from an initial small weight dur- 
ing training. A formal definition of dynamic A; is expressed 
as follows: 


(5) 


A = 1, 6 
1 + exp (—10 x p;) (6) 


where p; represents the learning progress, which increases 
linearly from zero to one. 7 and E denote the 7-th epoch being 


processed and the maximum number of epochs, respectively. 
This strategy ensures that the domain discrimination is less 
affected by noisy data in the initial stages of training. 


In the testing phase, the trained SRDA model utilizes a fea- 
ture extractor (i.e., the transformer) to capture temporal char- 
acteristics from real data, which are then fed into the predictor 
to forecast the operating-parameter variations in a real-world 
reactor. The domain discriminator and MK-MMD modules 
are not involved in the testing phase because their purpose 
is solely to assist the feature extractor in learning domain- 
invariant features during training. 


D. Principles of transformer 


A transformer [36] has an excellent capacity for handling 
time series and is utilized as the feature extractor within the 
SRDA model. A standard transformer has a sequence-to- 
sequence structure that incorporates an encoder and a de- 
coder. To capture the dynamic characteristics and temporal 
dependencies from the simulated and real data, the encoder 
in the transformer is utilized to map the inputs into a high- 
dimensional domain-invariant feature matrix. 
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Fig. 3. Structure of transformer 


As presented in Fig. 3, the encoder comprises positional 
coding, multi-head attention, layer normalization, and a feed- 
forward neural network. The positional information is cal- 
culated using sine and cosine functions [37]. Multi-head at- 
tention aims to capture the dynamic characteristics of special 
events that can enhance the sensitivity of the model to criti- 
cal moments or transient scenarios in reactors. As shown in 
Fig. 4, multi-head attention, as the basic module in the trans- 
former, first expands the input X; into a new embedding X; 
by an FC layer, which is described as follows: 


t = XW! X, E REXOS, (7) 


where W! is the weight of the input FC layer. k and d are the 
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Fig. 4. Structure of multi-head attention 


head numbers of the attention mechanism and feature dimen- 
sion, respectively. 


Embedding X; is further propagated to multiple heads 
where the weights are not shared among them. Each head 
has three FC layers and a scaled dot-product attention. The 
FC layers are employed to map X; into a query (Q € R**%), 
key (K e€ R**4), and value (V € R**4), which are ex- 
pressed as follows: 


Q, K, V = X,W9, X,W*, XW", (8) 
where W, W*, and W” are the weights of the FC layers. 


The scaled dot-product attention can calculate the correla- 
tion between Q and K to produce an attention map, which 
is employed as the weight of V; the calculation of which is 
described formulas follows: 


QK” 
vd 


where F's is the mapping function of the scaled dot-product 
attention. o(-) denotes a Softmax activation function. 


Fs(Q,K,V)=0 V, (9) 


The output of each head is concatenated and calculated us- 
ing an output FC layer. This process can be simplified as 
follows: 


Si = Fs (XW? XWK, XiWY) ie 0k] (10) 


F4(Q,K,V) =7(S1,---;, Sk) WÔ, (11) 


where F', denotes the mapping function of multi-head atten- 
tion. We, WK, and WY represent the weights of the FC 
layers in the 2-th head, whereas w°? represents the weight of 
the final FC layer. y denotes the concatenated operation. 


The feed-forward neural network, consisting of two FC 
layers and a rectified linear unit activation function, primar- 
ily boosts the nonlinear fitting capability of feature extraction, 
which is utilized separately for each position. 


E. Principles of MK-MMD 


A standalone domain adversarial strategy may have subop- 
timal effects or instability in simulation-to-reality knowledge 
transfer. Combining the adversarial strategy with MK-MMD 
can compensate for these shortcomings while promoting the 
stability and robustness of the model. In the SRDA frame- 
work, the dimensionality of the feature matrix X ș obtained 


S 
by the feature extractor is reduced to eigenvectors X f and 


xf, whose MK-MMD is then calculated using Eq. (12). 
LuK-mmp Can quantify the distribution discrepancies between 
simulated and real data using sophisticated high-dimensional 
mapping. Compared to traditional MMD, MK-MMD em- 
ploys a set of kernel functions to fully analyze data across 
different scales and dimensions, which enhances the ability 
to identify distributional discrepancies [38]. 


i= s 1 
LMK-MMD = T? (xf ) = ne 
t=1 


(12) 
p(-) denotes the function that maps the feature to the repro- 
ducing kernel Hilbert space. A kernel function K, which is a 
convex combination of m positive semi-definite kernels K,, 
is defined to avoid a complicated mapping. 


u=1 u=] 


where K, and Bu represent the u-th kernel function defined 
by the Gaussian kernel and its coefficient, respectively. u de- 
notes the number of kernels, which is set to five. 


IV. EXPERIMENTS 


In this section, the proposed SRDA model is evaluated us- 
ing two types of data (simulated and real), which are regarded 
as the source and target domains. Typical operating parame- 
ters are selected for forecasting, including twenty-four neu- 
tron fluxes (N1, +- , No4) and six temperatures (T), +- , Te) 
at different locations in the reactor. Neutron fluxes and tem- 
peratures play critical roles in reactors, as they are essential 
for monitoring the power distributions and levels of the reac- 
tor. As presented in Fig. 5, the neutron fluxes are gathered 
by ex-core neutron detectors (i.e., uncompensated ion cham- 
bers) to generate channel currents at six distinct heights in the 
four channels. The currents are amplified and converted into 
voltage signals. The temperatures recorded by the resistance 
thermometers correspond to the inlet and outlet temperatures 
in the three primary loops. 


The simulated data with a one-second sampling interval are 
produced by the full-scope simulator of a pressurized water 
reactor (PWR), which is meticulously designed to match the 
actual control station of an NPP, ensuring that every compo- 


Pressure 
vessel 


Ex-core neutron 
_ detector 


Inlet pip 


Thermometer 


Fig. 5. Monitoring devices for neutron fluxes and temperatures 


nent and system are precisely simulated. All critical nuclear- 
reactor systems such as the reactor core, cooling systems, 
control systems, and emergency-response systems are inte- 
grated into the simulator to provide a comprehensive simula- 
tion environment. To extract abundant information and trans- 
fer knowledge, a full-scope simulator produces various transi- 
tory data under varying power. The actual data originate from 
a real-world digital instrumentation and the control system in 
a PWR. 


Nuclear reactors mostly operate in a steady state, owing to 
their operating characteristics. Transient operation rarely oc- 
curs, except when it is caused by external factors such as grid 
peaking, shutdowns, and faults. To demonstrate the reliability 
of the results, the target domain data contains two sets: 37,000 
samples of shutdown data and 60,000 samples of power vari- 
ation with one-second and ten-second sampling intervals, re- 
spectively. In the two transient scenarios, the control rods are 
manipulated to induce perturbations in the three-dimensional 
power distribution, which characterizes the different degrees 
of change in the reactor. In the joint training phase, the train- 
ing set is composed of all source data and the first 5% of the 
target domain data (only steady-state operation). The remain- 
ing target-domain data are used as the test set. For the two 
test sets with different sampling intervals, the past 180 steps 
(3 and 30 min) of the historical data are applied to recursively 
predict the data of the next 60 steps (1 and 10 min, respec- 
tively). 


A. Experimental setup 


Examining the effects of various feature extractors on the 
SRDA model can provide valuable insights. Six representa- 
tive deep-learning networks are applied to explore the gen- 
eralizability of the proposed framework: autoencoder (AE), 
CNN, recurrent neural network (RNN), LSTM, gated recur- 
rent unit (GRU), and temporal convolutional network (TCN). 


Table 1. Parameter settings of various feature extractors 


Model Parameter Value 

AE Neurons of encoder 64 
Neurons of decoder 64 
Number of filters 32 

CNN Filter size 3 

RNN Bi-direction structure True 
Neurons of hidden layer [32,32] 
Bi-direction structure True 

LSTM Neurons of hidden layer [32,32] 

GRU Bi-direction structure True 
Neurons of hidden layer [32,32] 
Number of filters 32 

TCN Number of residual layer 2 
Filter size 13 
Dilated factor [1,2] 

Table 2. Parameter settings of the SRDA model 

Module Parameter Value 
Input length 180 
Output length 60 
Neurons of predictor [32, 64, 60] 

SRDA Neurons of discriminator [32, 16, 16, 2] 


(transferability Number of kernels in MK-MMD 5 


framework) Optimizer Adam 
Epoch 200 
Learning rate 0.001 
Batch size 32 
Number of multi-head 2 
Feature dimension 32 
Transformer 
Number of encoder layers 4 
(feature 
Neurons of feedforward 
extractor) [32, 128, 32] 
neural network 
Dropout 0.1 


The key parameters of each model are listed in Tab. 1. 


In addition to comparing different feature extractors, 
the SRDA model is compared with six advanced domain- 
adaptation methods in parallel to prove its superiority. Ow- 
ing to the limited domain-adaptation methods available for 
forecasting tasks, we modified the existing methods proposed 
for time-series or visual classification. Comparison meth- 
ods include deep domain confusion (DDC) [39], correla- 
tion alignment via domain adaptation (CA-DA) [40], mini- 
mum discrepancy estimation for domain adaptation (MDE- 
DA) [41], a DIRT-T approach to domain-adversarial adap- 
tation (DIRT-T) [42], an adaptive domain-adversarial neu- 
ral network (ADANN) [43], and adversarial spectral-kernel 
matching for domain adaptation (ASKM-DA) [44]. The hy- 
perparameters of the aforementioned approaches are ratio- 
nally set in accordance with corresponding studies to ensure 
fairness. In the training phase, the learning rate and batch 
size in all models are critical parameters adjusted by grid op- 
timization. For the proposed SRDA, the parameter settings 
are listed in Tab. 2. All the AI models are developed using 
PyTorch 2.0.1 in Python version 3.8. 


Three precision metrics, namely the root mean square error 


(RMSE), Mean absolute error (yar), and symmetric mean 
absolute percentage error (ÔsMApE), are adopted to evalu- 
ate the forecasting performance. The smaller the RMSE, 
Omar, and dsmapr metrics, the higher the prediction accu- 
racy. These can be calculated as follows: 


n ~\2 
ÖRMSE = ee (14) 
i 7 
ômar = —) lve — Gel (15) 
t=1 
OSMAPE = : 3 Ive med x 100 (16) 
n = [ye + 9] /2 


where n is the total number of test samples. y; and y are the 
actual and predicted values at time t, resepctively. 


B. Forecasting results 


Experiments on forecasting tasks are conducted using 
source-only, target-only, and source-target models as base- 
lines to validate the effectiveness of the proposed SRDA 
model for knowledge transfer from simulation to reality in 
nuclear reactors. The source-only model is trained exclu- 
sively on the source-domain training set and directly tested 
on the target-domain test set. Similarly, the target-only model 
is trained on the target-domain training set and directly tested 
on the source-domain test set. The source-target model is 
trained on the source-domain data using conventional trans- 
fer learning, and it fine-tunes the weights of its final layer on 
the target-domain data. Furthermore, a transformer and im- 
proved logarithmic loss are used to construct the three base- 
lines mentioned above. Domain adaptation is not employed 
in this process. Forecasting of the neutron flux N; and hot- 
leg temperature T) are performed as examples. As shown 
in Fig. 6, the target-only model trained using the first 5% of 
the steady-operation data exhibits the largest predictive devia- 
tion. After learning a sufficient number of simulated samples, 
the trained source-only model can adapt to the basic varia- 
tional trend of real neutron fluxes and temperatures, resulting 
in suboptimal effects attributable to the difference between 
the simulation and reality. Although the source-target model 
outperforms the above two models, it exhibits a certain de- 
viation in the local area. Compared with the three baselines, 
the predictive trend obtained by the SRDA model with do- 
main adaptation is generally closer to the real curves and free 
from the interference of operational noise in complex nuclear 
systems. 


Tab. 3 provides the specific average and standard deviation 
of the errors (6pmsE. OMAR, and ôsmape) for all the mod- 
els in the two test sets. For neutron-flux forecasts, the SRDA 
model demonstrates superior performance during shutdown, 
with a remarkably low õrmse of 0.010 (V) and dyar of 
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Table 3. Forecast errors of the SRDA model compared with various baselines 
Forecasting Model Shutdown Power variation 
target õrmse(V/°C) Oman(V/°C)  dsmapp/% Ormsp(V/°C) dman(V/°C)  dsmapr/% 
Target-only 0.902+0.688  0.903+0.689 61.549 + 23.977 0.10740.051 0.105+0.049 8.710+4.543 
Source-only 0.034+0.011 0.041+0.011 6.615+1.639 0.035+0.014  0.036+40.014 3.083+1.733 
Neutron flux Source-target 0.028-£0.005 0.030+0.005  5.58741.277 0.026+0.035  0.027+0.035  2.192+3.081 
SRDA* 0.010+0.002 0.012+0.003 1.248+0.287 0.008+0.003 0.009+0.003 0.636+0.178 
Target-only 4.918+4.379 491444377  1.653+1.482 1.246+1.092 = 1.252+1.096 =0.393+0.331 
Source-only 0.825+0.697  0.825+0.696  0.280+0.239 0.35740.134  0.369+0.123 0.118+0.049 
Temperature Source-target 0.7770.749  0.778+0.748 0.264+0.256 0.351+0.140  0.363+0.130 0.116+0.051 
SRDA* 0.113+0.079  0.118+0.085 0.037+0.026 0.223+0.111 0.231+0.110  0.073+0.038 


0.012 (V). Moreover, dsm APE İs an order of magnitude lower the inlet and outlet temperature forecasts, the SRDA model 
than that of its counterparts, at 1.248%. This precision is achieves excellent results, with the lowest ORMSE> Omar, and 
also reflected in the power-variation scenarios, where the dsmMApE Under the shutdown and power-variation conditions. 
SRDA model achieves an dpusp of 0.008 (V) and MAE of In addition, the low standard deviation further highlights the 
0.009 (V), along with a notably low osmAPE Of 0.636%. For stability of the SRDA model. Case experiments demonstrate 
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Fig. 7. Comparison of ôsmape by various feature extractors in the SRDA model 
Table 4. Comparison of forecast errors by various feature extractors in the SRDA model 
Forecasting Feature Shutdown Power variation 
target extractor ôrmse(V/°C) dOman(V/°C)  dsmapE/% dpmsE(V/°C) dOmMan(V/°C) dsmapE/% 
SRDA (AE) 0.023+0.013 0.038+0.044 3.496-+1.791 0.022+0.009 0.028+0.011 1.817+0.942 
SRDA (CNN) 0.026+0.007 0.030+0.010 3.525+1.344 0.019+0.015 0.020+0.015  1.464+1.176 
SRDA (RNN) 0.044+0.035 0.0480.036 4.9023.013 0.014+0.012 0.015+0.013 1.0770.829 
Neutron flux SRDA (LSTM) 0.021+0.013 0.024+0.015  2.563+1.410 0.014+0.008 0.015+0.008 — 1.076-40.589 
SRDA (GRU) 0.024+0.009 0.026+40.009  3.375+1.065 0.013+0.006 0.015+0.006  1.076+0.577 
SRDA (TCN) 0.010+0.003 0.013+0.003 1.3650.280 0.009+0.002 0.010+0.003 0.686+0.198 
SRDA (Trans)* 0.010-0.002 0.012+0.003 1.248+0.287 0.008+0.003 0.009+0.003 0.636+0.178 
SRDA (AE) 0.376+0.148  0.497+0.239  0.123-+0.047 0.70740.365  1.050+0.583 0.226+0.115 
SRDA (CNN) 1.276+0.730  1.628+0.977 0.419+0.238 0.429+0.468 0.513+0.514  0.135-40.143 
SRDA (RNN) 0.300+0.195 0.339+0.264 0.0980.062 1.074+0.982  1.120-+0.981  0.345-£0.305 
Temperature SRDA(LSTM)  0.241+0.094  0.252+0.093 0.079+0.030 0.379+0.197  0.39740.206  0.122+0.063 
SRDA (GRU) 0.256+0.153 0.271+0.154  0.084-+40.047 0.241+0.048 0.258+0.048  0.078+0.015 
SRDA (TCN) 0.177+0.131  0.181Æ0.133  0.058£0.043 0.254+0.179 0.2700.180 0.0830.056 
SRDA (Trans)* 0.113+0.079 0.118+0.085 0.037+0.026 0.223+0.111 0.231+0.110 0.07340.038 


the effectiveness of domain adaptation in transferring knowl- 
edge from simulation to reality in a practical reactor. 


C. Forecasting comparison and analysis 


Temporal feature extraction within the SRDA model is 
interlinked with the operating-parameter prediction perfor- 
mance. To analyze the impact of various feature extractors, 
experiments are conducted to replace the transformer in the 
SRDA framework with the six representative neural networks 
specified in Section IV A: AE, CNN, RNN, LSTM, GRU, and 
TCN. As depicted in Fig. 7, the SRDA framework consis- 
tently achieves favorable outcomes across various feature ex- 
tractors, demonstrating its broad versatility. However, differ- 
ent backbone networks moderately affect the prediction accu- 
racy for neutron fluxes and temperatures during the shutdown 
and power-variation phases. Owing to the lack of inherent ar- 
chitecture in AE and CNN for recording temporal dependen- 
cies, the SRDA (AE) and SRDA (CNN) models exhibit in- 
sufficient feature-extraction capabilities. The SRDA (RNN) 
makes unstable predictions, as reflected in its dgyapp, OW- 
ing to the absence of effective memory mechanisms. The 


SRDA (LSTM) and SRDA (GRU) models incorporate mem- 
ory cells and gating mechanisms to mitigate issues, such as 
vanishing gradients, thereby bolstering the temporal feature- 
extraction process. SRDA (TCN), which incorporates dilated 
causal convolutions with a larger receptive field to capture 
long-term characteristics, exhibits precision comparable to 
that of SRDA (Trans) and is a robust contender. The multi- 
head attention block of the proposed SRDA (Trans) allows 
it to capture both subtle long- and short-term dependencies, 
which makes it superior to other feature extractors in adapting 
to complex variations. The experimental results are demon- 
strated by the steady OsmAPr in the forecasting under reactor- 
shutdown and power-variation conditions. 


Tab. 4 presents a detailed comparison of the predictive 
precision across various feature extractors for the two test 
sets. In summary, the models intricately designed for time- 
series analysis, such as TCN and the transformer within 
the SRDA framework, possess advanced temporal feature- 
extraction capabilities. This facilitates more effective domain 
adaptation, resulting in enhanced predictive performance. Al- 
though SRDA (TCN) yields formidable and competitive re- 
sults, SRDA (Trans) demonstrates unparalleled performance 
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Fig. 8. Comparison of forecast errors by various domain-adaptation methods 


for both neutron fluxes and temperatures. For example, 
in inlet- and outlet-temperature prediction, compared with 
the average errors (ÔRMSE, OMAR, and dgsmapr) of SRDA 
(TCN), SRDA (Trans) is improved by 36.158%, 34.807%, 
and 36.207% in shutdown, as well as by 12.205%, 14.444%, 
and 12.048% during power variation, respectively, further 
showcasing the superiority of the transformer for capturing 
temporal dependencies in complex, dynamic nuclear-reactor 
systems. 


To demonstrate the knowledge-transfer superiority of the 
proposed method from simulation to reality, the SRDA model 
is compared with the six advanced domain-adaptation meth- 
ods specified in Section IV A, namely the DDC, CA-DA, 
MDE-DA, DIRT-T, ADANN, and ASKM-DA models. To 
ensure a fair evaluation, the transformer and improved log- 
arithm losses are applied in the aforementioned methods. 
As presented in Fig. 8, the DDC and CA-DA models are 
statistic-based domain adaptations, and are designed to mit- 
igate the distribution discrepancies between the simulated 
and real features. Their knowledge-transfer ability for neu- 
tron fluxes and temperatures during shutdown is inadequate, 
leading to larger errors. MDE-DA is a composite method 


that improves the predictive stability on the two test sets 
by fusing second-order statistics in CA-DA with MMD in 
DDC. With the incorporation of conditional entropy and a 
teacher model, the adversarial-based DIRT-T approach effec- 
tively forces the feature extractor to align domain-invariant 
features. This methodology demonstrates moderate perfor- 
mance in terms of inlet- and outlet-temperature predictions. 
In contrast, ADANN and ASKM-DA, specially designed for 
time-series-analysis tasks, exhibit more refined accuracy be- 
cause of their marginally smaller errors compared to DIRT- 
T. Based on Fig. 8, adversarial-based methods tend to per- 
form consistently better, thereby enhancing the generaliza- 
tion from simulation to reality. The finesse of the SRDA 
model involves adopting a domain adversarial strategy to ex- 
tract deep domain-invariant features between the simulated 
and real data, in addition to utilizing MK-MMD to mitigate 
their distribution discrepancies. This dual approach guaran- 
tees that the model maintains the essential characteristics of 
the simulation while adjusting to the distribution and fluctu- 
ations that exist in a real-world reactor, which considerably 
improves its forecast precision. 


D. Model interpretation 


The proposed trade-off loss function includes the predic- 
tor Lp, discriminator La, and MK-MMD losses LmK-MMD- 
Because £4 and £yyx-mmp have been proven to enhance per- 
formance in Section IV C, we focus on investigating the con- 
tribution of £p to the prediction precision. £p acts as an im- 
proved logarithmic loss and is responsible for adapting the 
predictor losses under different power levels in nuclear re- 
actors. We compared the logarithmic loss with conventional 
mean square error (MSE) loss. An example (only six layers 
of neutron flux in one channel are shown) of the results for 
the neutron-flux forecasting task is shownin F ig.9 . The 
SRDA model using the MSE loss achieves qualified predic- 
tions for the SRDA model, with only a marginal divergence 
from the actual values observed at lower reactor-power lev- 
els. This deviation can be attributed to situations in which 
the MSE loss changes significantly with data scaling, making 
the model less sensitive at low powers, and the value is close 
to zero. The proposed SRDA model exhibits superior pre- 
dictive accuracy for different power operations by balancing 
different numerical scales. Enhanced precision is critical for 
fine-tuned a pplications in nuclear-reactor o perations, where 
minute deviations can have significant implications. 


To present the transferability effect of the domain adap- 
tation intuitively, feature distributions are visualized using 
t-distributed stochastic embedding (t-SNE). t-SNE is a non- 
linear dimensionality-reduction algorithm that can transform 
a high-dimensional feature matrix into a two-dimensional 
eigenvector for visualization. As depicted in Fig. 10, the 
two domains are color-coded, with red denoting the simu- 
lated data and blue denoting the real data. In detail, Fig. 10(a) 
and Fig. 10(c) show the feature distributions without domain 
adaptation for both datasets. The features of the two domains 
only overlap locally, illustrating the similarities and discrep- 
ancies that commonly exist between simulation and reality. 
Thus, directly applying a model trained on simulated data to 
real data results in unsatisfactory forecasting results owing 
to a domain shift. In contrast, Fig. 10(b) and Fig. 10(d) 
show the feature distribution after the feature extraction in 
the SRDA model. Notably, after the domain adaptation, the 
distributions of the extracted features from the simulation and 
reality are uniformly mixed, illustrating that the SRDA model 
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can mitigate the domain discrepancies effectively to enhance 
the operating-parameter prediction in reactors. 


V. CONCLUSION 


Simulators imperfectly emulate reality in NPPs due to their 
different actuation modes and system dynamics. This study 
aimed to mitigate the discrepancies in noise, numerical distri- 
butions, and dynamic characteristics between simulated and 
real data. A novel transferability framework, called the SRDA 
model, was proposed for forecasting critical parameters in nu- 
clear reactors. The SRDA framework comprised a feature ex- 
tractor, parameter predictor, domain discriminator, and MK- 
MMD. Relative to several advanced domain-adaptation meth- 
ods, the results indicated that the SRDA model demonstrates 
superior knowledge transfer by leveraging ample simulated 
and finite real data. The transformer-based feature extractor 
adeptly captured the dynamic characteristics and temporal de- 
pendencies in transient conditions, such as reactor shutdown 
and power variation, as evidenced by comparisons with vari- 
ous feature extractors. The improved logarithmic loss within 
the predictor was conducive to enhancing forecasting preci- 
sion at various power levels. Furthermore, the integration 
of the domain adversarial strategy and MK-MMD effectively 
adapted to the distributions and fluctuations in real-world re- 
actors while retaining the essential characteristics in the simu- 
lation. Considering the significant impact of different feature 
extractors, the versatility of the SRDA model enables the sub- 
stitution of backbone networks tailored to specific scenarios 
in NPPs, which is another intriguing problem. 
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